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This chapter provides an introduction to the use of diagrammatic language, or 
perhaps more accurately, diagrammatic calculus, in quantum information and quan¬ 
tum foundations. We illustrate the use of diagrammatic calculus in one particular 
case, namely the study of complementarity and non-locality, two fundamental con¬ 
cepts of quantum theory whose relationship we explore in later part of this chapter. 

The diagrammatic calculus that we are concerned with here is not merely an 
illustrative tool, but it has both (i) a conceptual physical backbone, which allows it 
to act as a foundation for diverse physical theories, and (ii) a genuine mathematical 
underpinning, permitting one to relate it to standard mathematical structures. 

(i) The conceptual physical backbone concerns compositionality. Given two sys¬ 
tems, there is also a composite system. This notion of composition is a primitive 
ingredient of the diagrammatic language. Moreover, the basic elements of the dia¬ 
grammatic language are processes, and states are identified with preparation pro¬ 
cesses. This paves the way for a framework of generalised compositional theo¬ 
ries (GCTs), named in analogy to generalised probabilistic theories [1], The latter 
have recently received much attention because one can better understand a theory— 
quantum theory in particular—by studying it as merely a member of a broader class 
of theories. Notably, the study of non-locality within this framework has provided 
important new insights [2, 3], Whereas generalised probabilistic theories discard 
everything except the convex probabilistic structure, in contrast, GCTs focus on 
composition. This approach is informed by techniques used in computer science, 
logic, and the branch of mathematics called category theory, however its roots can 
be traced to Schrodinger’s conviction that the essential characteristic of quantum 
theory is the manner in which systems compose [4], 
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(ii) On the other hand, the diagrammatic language has a well-defined mathemat¬ 
ical meaning, which permits any diagram to be interpreted as a definite object in 
various other concrete mathematical models, for example in Hilbert spaces. This 
translation can be carried out in a formally precise manner, so that reasoning in the 
diagrammatic calculus produces true equations in the chosen model. At the same 
time, the relationship between what is provable in the calculus and what is provable 
in concrete models can be described to a high degree of precision. 

We won’t discuss this mathematical basis in detail here, however it may be sum¬ 
marised as follows: the diagrammatic calculus is itself a GCT, and GCTs form a 
certain class of monoidal categories, also known as tensor categories. The use of 
diagrammatic languages for tensors traces back to Penrose in the early 1970’s [5], 
but was only placed on a formal mathematical basis in the late 1980’s [6, 7], Their 
use in quantum foundations and quantum information began with an abstract (par¬ 
tial) axiomatisation of Hilbert spaces in terms of these categories [8], eventually 
resulting in so-called quantum picturalism [9], Meanwhile, the diagrammatic com¬ 
positional language has been adopted by several researchers in quantum foundations 
[10, 11], The particular developments related here been used to solve problems in 
quantum foundations [12, 13] and quantum computation [14, 15, 16]. 


1 Introduction to Quantum Picturalism 
1.1 Theories and Diagrams 

A generalised compositional theory consists of systems, or more accurately types 
of systems, and processes which transform systems. A process / which transforms 
systems of type A into systems of type B is written f: A—¥ B. At the highest level of 
generality we do not need to give any details as to what A, B, or / are: it is enough to 
know that that / accepts systems of type A as inputs and produces systems of type 
B as outputs. The important thing is how systems and processes are combined. 

Mathematically speaking, general compositional theories are strict symmetric 
monoidal categories, and a full exposition of their properties would require a lengthy 
detour into category theory. The interested reader can refer to Mac Lane’s clas¬ 
sic text [17] for a thorough treatment. However, we can avoid reading Mac Lane’s 
book 1 by adopting a diagrammatic notation, which absorbs all of the relevant equa¬ 
tions into the syntax. This notation is the subject of the first section of this paper. 

We will represent processes by diagrams, consisting of boxes and wires. The 
wires are labelled by systems, and the boxes by basic processes 2 . Wires join boxes at 
the top and bottom; the wires below correspond to the input systems of the process. 


1 We jest; reading Mac Lane's book is eventually unavoidable, however the paper [18] is an easy 
introduction to the subject of monoidal categories. 

2 The term “basic” simply means a process whose internal structure is of no interest. Typically we 
construct diagrams from some given set of basic processes. 
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and those at the top correspond to the output systems. For example: 



f:A->B g:A®B^B®A S:A^A(g)A 

The same is true for the diagram as a whole: the wires entering the bottom of the 
diagram are its input systems, and those leaving from the top are its outputs. 

Given processes / : A —► B and g : B —> C, it seems obvious that doing / then 
g is again a process, and we write g o / : A —> C to denote this process. In other 
words, processes admit sequential combination ; we will usually call this operation 
composition. 

Similarly, a pair of systems, say A and B , can be taken together and viewed as 
a single system, A®B. Now, given a pair of processes / : A —> B and g : A' —y B ', 
a new process is obtained by placing them in parallel. We denote the combined 
process f ®g : A®A' —r B Q B'. This operation of parallel combination is called 
tensor. 

In the diagrammatic notation, composition is expressed by plugging the outputs 
of one box into the inputs of another, and the tensor is given by juxtaposition. 



We require that both operations, composition and tensor, are associative and obey 
the interchange law. 


(f®g)°(h®k) = (foh)<g>(gok). (1) 

In the graphical notation, all of these equations become trivial: they boil down the 
statement that the three diagrams below are unambiguous. 



While it is easy to translate these diagrams back into conventional notation, to do 
so we must make a choice of where to put the brackets, even though the theory 
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tells us this choice does not matter. This highlights a key advantage of working with 
diagrams, namely that the objects which are equal in the theory produce the same 
diagram. 

In addition to the two operations, composition and tensor, every generalised com¬ 
positional theory is equipped with certain primitive processes. The simplest process 
is the process which doesn’t do anything at all, simply returning unchanged the sys¬ 
tem given to it. We assume that for every system A such a null process, called the 
identity and written l a : A — > A, exists. The fact that it does nothing is expressed by 
the equations 

l«°/ = / = /° U 

for all processes / : A — > B. The identity process l a ■ A —> A is drawn as a wire 
without any box on it, while the identity for A®A' is simply the tensor product 
1 a ® 1 an i- e - t wo wires. 






1a® A' — A 

® A' 

= A 

a! 


Once again we see an equation absorbed into the notation: since the identity has no 
effect on a process, the length of the wires attached to a box makes no difference. 



In addition, for every pair of systems A and B there is a process <5a.b : A ®B —> 
B®A which exchanges the two systems. The class of theories we consider here are 
symmetric: swapping two systems twice has no effect, hence the equation 

&b,a ° <5a,b = 1 Am 

holds for all systems A and B Graphically, the swap is just the crossing of two wires: 



In fact, the swap should satisfy some further coherence equations, the details of 
which can be found in [17]. However, we can again make the graphical notation 
do the work by allowing wires to cross freely in the diagrams, and saying that only 
the connectivity of the wires matters, and not their configuration in the page. For 
example, the following diagrams are equal: 
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Note that we do not distinguish between wires crossing over and crossing under. 

A process may produce an output without having to consume an input first, or 
vice versa. Therefore we introduce a null system, or empty system, which we denote 
I. Hence a process that produces an A from nothing would be written p : I —>• A. Like 
the identity process, the null system obeys some equations: 

A®/=A=/®A and 1/®/= / = /<£> 1/, 

for all systems A and all processes /. As suggested by the preceding equations, I 
is represented as empty space in the diagram, and its identity process 1/ :/—)■/ is 
represented by the empty diagram. 

^ f ♦ 

A process of type s : I —> I is called a scalar, this name will be justified later. It 
is clear from the diagrammatic notation that given scalars s and s' we have sos — 
s®s'=s'®s = s'os; i.e. the scalars form a commutative monoid 3 . 

In the preceding text we have introduced various transformations of diagrams 
which, we claim, do not change anything. It is reasonable to ask: when are two dia¬ 
grams considered to be equal? We use a very intuitive notion here: Two diagrams 
are considered equal when, keeping the inputs and outputs fixed, one may be 
transformed to the other by purely topological transformations. In other words, 
if starting from one diagram we—by crossing or uncrossing wires, stretching wires, 
moving boxes along wires, translating boxes in the plane (while maintaining their 
connections), etc—arrive at the other, then they are equal. In particular, since scalars 



Fig. 1 Examples of topologically equivalent diagrams. 


3 This is true even for non-symmetric monoidal categories; see [7], 
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are not connected to the inputs or outputs of the diagram, they may be placed any¬ 
where in the diagram without altering its meaning. 

Example 1. The simplest non-trivial example is the theory with one primitive sys¬ 
tem, denoted u, and whose processes are generated by the identity and swap. We 
call this theory SymGrp. Since there is only one basic system, every other system 
is just an n-fold tensor power of n, hence the systems of the theory can be identified 
with the natural numbers. In this theory, a process p : n —> n is nothing more than 
a sequence of swaps; i.e. a permutation on the n-element set. Hence SymGrp is 
exactly the theory of the symmetric groups. 




Fig. 2 Example: the symmetric group S 3 presented as diagrams. 


Example 2 (Finite-dimensional Hilbert spaces). The theory called FHilb has as its 
systems all finite-dimensional complex Hilbert spaces. The processes of this theory 
are all linear maps / : A —> B. The sequential composition of processes is the usual 
composition of linear maps, and the tensor is the usual Kronecker product of vector 
spaces and maps. The identity process is the identity map, the swap is the evident 
permutation map, and the null system is the base field, C. Since a linear map C —> C 
is totally determined by its value at 1, we see that the scalars of FHilb are nothing 
more than the complex numbers themselves. 

We write FHilb/; to denote the subtheory FHilb restricted to Hilbert spaces of 
dimension D" and linear maps between them, for some fixed D. For convenience, 
we refer to FHilbo as Qubit. Notice that the systems of Qubit are all tensor powers 
of C 2 , and its processes include all quantum circuits, state preparations, and post- 
selected measurements, justifying the name. 

Remark 1. Note that we must specify what the tensor product is to specify what 
the theory is. For example, another equally valid theory is the collection of finite 
dimensional Hilbert spaces and linear maps, but with the direct sum as the tensor. 
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This is again a general compositional theory, although since it lacks certain other 
features we will require later, it will play no role in this presentation. 

Example 3 (Sets and Relations). An example with very different flavour, but most 
of the same structure is FRel. The systems of FRel are all finite sets (considered up 
to isomorphism 4 ), and the processes r : A —x Y are relations between A and Y, that 
is subsets of A xf. The composition of relations is given by 

sor= {( x,z ) | 3y s.t. ( x,y ) G r and ( y,z ) G s}. 

The identity process is the diagonal relation, 

l x = {(x,x) | VGA}. 

The tensor product in FRel is the cartesian product A®f = Axf, which takes the 
form 

r®r' = (y,y’)) \ (x,y) G r and (x?,y') G r’} 

on processes. The null system is the singleton set {*}, for which we have {*} x A = 
A for all sets A. There are exactly two relations from {*} to itself, namely the total 
relation and the empty relation. Hence, the scalars of FRel are the Boolean monoid, 
i.e. Zi with the usual multiplication. 

An important subtheory of FRel is FSet, obtained by restricting the to relations 
which are functions: that is, relations r : X — x Y where each x is related to exactly 
one v. Just as in the case of FHilb, we can consider restrictions of FRel to systems 
generated by a set of size D , which we call FRel/). For example, FReF contains all 
the Boolean functions. The intersection of FReF and FSet consists of precisely the 
Boolean functions; this theory we denote Bool. Many other interesting theories are 
subtheories of FRel; we’ll meet some more later. 

Since generalised compositional theories all share certain basic structure, it is 
natural to consider maps between them. Given two such theories C and D, a map 
F : C —x D consists of an assignment of each system A in C to a system FA in D, 
and an assignment of each process / : A — x B in C to a process F f : FA —x FB in D, 
obeying the following equations: 

F(A®B)=FA®FB FI = I 

F{g°f)=FgoFf F(f®g)=Ff®Fg 

F\a = 1 FA FOa.B = OFA.FB 

In the mathematics literature, such a map is called a strict symmetric monoidal func¬ 
tor; again, see Mac Lane [17] for the details. The important point to note is that the 
mapping F sends wires to wires. Therefore, to specify such a mapping it is enough 
to specify the image of the boxes in a diagram, ensuring that composition and tensor 
are respected. 

4 Since we identify sets of the same cardinality, we can equivalently say that the systems of FRel 
are just the natural numbers. 
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Example 4. We can define a map Rp '■ SymGrp FHilb by setting Rp(u) = C D 
and then everything else is defined by the requirement that Rp is a strict symmetric 
monoidal functor. Thus we have a D" dimensional representation of the symmetric 
group S n for every D. 

In fact, this construction applies equally well to any generalised compositional 
theory C: all that is required is an assignment of the unique primitive system u to 
some system of C. Therefore every generalised compositional theory contains all 
the symmetric groups. 

Given a mapping between theories it is easy to calculate the image of a given 
diagram. One must recursively partition the diagram into tensors and compositions 
of smaller diagrams until each partition contains exactly one element—that is, ei¬ 
ther a single wire, a crossing of wires, or a box. The interchange law (Equation 1) 
guarantees that the result does not depend on the partition chosen. 



We may now state: 

Theorem 1 (Fundamental Theorem of Diagrams). Given any two generalised 
compositional theories C and D, and a map F : C —> D, for any two diagrams d 
and d' in C, if d = d' as diagrams then Fd = Fd' in D. 

This theorem has many variations, and we refer the reader to Selinger’s survey arti¬ 
cle [19] for the full details. 

Remark 2. In the diagrams to come, we will often use horizontal separation to indi¬ 
cate separation in space and vertical separation to indicate separation in time. For 
example. 



depicts the creation of two systems by the process <J>, which then become spatially 
separated over some time and are acted upon by processes / and g respectively. 
Since, as we already know, topologically equivalent diagrams are equal, these sepa¬ 
rations have no formal status and are purely illustrative. 
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1.2 Rewrites and Models 

Since we wish to generalise over many concrete mathematical structures, we are 
particularly interested in theories which can be specified axiomatically. That is, to 
specify the theory we state (i) the list of basic systems—typically we’ll only have 
one basic system, the rest being generated by the tensor product—and (ii) the ba¬ 
sic processes. The processes of the theory are then all the diagrams which can be 
constructed from these processes and nothing else. 

Example 5 (Boolean Circuits). A simple example of a compositional theory is 
BoolCirc, the theory of boolean circuits. This theory has only one basic system, 
the bit b , and the basic processes are the logic gates: 

I Y 

A: b<S>b—> b V :b®b—> b —>:b-^b FAN :/?—>■£> ® Z? 

A process in this theory is a circuit for computing some boolean function, built up 
from these basic gates. 




Fig. 3 A Boolean circuit to compute (xA-<y) V -i(yAz). 


It is tempting to assume that BoolCirc is related to the theory of Boolean func¬ 
tions, and we can make this precise by specifying a mapping B : BoolCirc -A Bool. 
We assign B(b) = {0,1} and define B on the basic processes as follows: 
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'00 i^O 

1 

'00^0 

^3 

II 

01 ^ o 

<: 

II 

0 

01 ^ 1 

10 ^ 0 

10 H> 1 


11 ^ 1 


11 H> 1 

V 

B {~) =n: <| 

'0^1 

1 h> 0 

fi(FAN) = S : < 

f0 ^ 00 

[ 1 H> 11 


The mapping B assigns to each diagram the boolean function normally associated 
with it. However this is not the only possibility. Consider the following mapping, 
P : BoolCirc - Bool. Once again P(b) = {0,1}, but now we have the following 
assignment of processes: 


P( A) = a : 


PH = i ■ 


roo^o 


'00h^0 

1 01 ^ 0 

^0 

<: 

II 

^3 

01 h> 1 

| 10^0 

10 1 

[ 11 1 


11 h> 0 

r 0 h> 0 
[1^1 

P(FAN) = 8 : <j 

f 0 ^ 00 

[ 1 H> 11 


The mapping P assigns to each d : b" —> b in BoolCirc an n-variable polynomial 
over the ring Z 2 . (More generally a circuit with multiple outputs produces a list of 
polynomials, one for each output.) 

In fact, as the example of P suggests, the diagrams of BoolCirc admit an inter¬ 
pretation in any setting with two binary operations and one unary operation. This is 
not entirely satisfactory. In order to capture more than the bare syntax of any given 
theory we need to impose some additional equations on the class of diagrams. We 
do this via rewrite rules. 

A rewrite rule consists of a pair of diagrams of the same type, for example d: A 
B and d': A —» B. If this rule is called r then we write r: d =>■ d', or diagrammatically 



Whenever d occurs as a subdiagram of a larger diagram e then we can replace d 
with d' in <?, written e[d] e[d'], or in diagrams: 
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Rewrite rules allow us to define a notion of equality in addition to the basic equality 

Si 

of diagrams. Given a collection of rewrite rules Si we write d => d if there is some 
rewrite sequence in Si taking d to d . Evidently => is a transitive relation; let = be its 
symmetric, reflexive closure. Then we say that two processes are equal according 
to Si if their corresponding diagrams satisfy d = d . Typically we’ll exhibit this 
equivalence as a sequence of rewrites. 

Example 6 (Boolean circuits). Consider the following two rewrite rules for BoolCirc, 
expressing respectively the distributivity of AND over OR, and (one half of) De 
Morgan’s law. 



Now we can show that a certain Boolean circuit can be transformed into its disjunc¬ 
tive normal form: 


A A r ^ 



Given a theory C, a set of rewrite rules Si , and a mapping F : C —► I ), we can ask 
the following question: if d = d' in C, is it the case that Fd = Fd' in D? 

This property is called soundness. A sound mapping F : C —> D is called an 
interpretation of C in D, and the image of C in D is called a model. In the example 
above, the mapping B is sound, hence it provides an interpretation of BoolCirc (and 
Si) in Bool; on the other hand P does not, due to the failure of De Morgan’s law. 
Generally speaking we will always work with a given set of rewrite rules and a given 
interpretation map, so we will usually say “the D interpretation of C”, although in 
principle there could be many. 

Remark 3. The converse property to soundness, Fd = Fd' implies d = d', is called 
completeness. An interpretation which is both sound and complete provides an iso¬ 
morphism between the formally presented theory and its model. While checking 



GCTs and Diagrammatic Reasoning 


13 


soundness is straightforward, showing completeness is often much more difficult 5 . 
On the other hand, not having completeness means there are multiple models of a 
given theory, and the study of the differences between such models is often infor¬ 
mative. 

Before moving on, we’ll introduce an important example, and its standard model. 

Example 7 (Quantum Circuits). Similar to the example of Boolean circuits, we can 
also view (post-selected) quantum circuits as generalised compositional theory, 
called QuCirc. Again we have a single basic system, the qubit Q, and the basic 
processes are a collection of unitary gates, state preparations, and projections from 
which we construct the other quantum circuits. 



|0 ):/-H2 |1 (0|: G -> / <1| :Q-H 



-0 


Za '■ Q Q Xp • Q Q 7\X : Q(g> Q —> Q(gi Q 

From these basic elements we can write down any quantum circuit. We now define 
the standard interpretation of QuCirc into Qubit. 

IG]=C 2 


[T_1_ 


_i_ 





|o) 

= |o> 

|i) 

= |i> 

(0| 

T 

= <o| 

(It 

T I 


<i| 



l n 

A 


/I 0 0 0\ 
0 100 

T ^ 

7 


000 1 
^0 0 10/ 


5 To show completeness for a rewrite theory it is typically necessary, but rarely sufficient, to check 
that the rewrite rules are confluent ; that is, whenever two rewrites simultaneously apply to a given 
diagram, then the choice between then (eventually) does not matter. Since this property must hold 
for every diagram and every pair of rewrites, even a simple rewrite system can produce an ex¬ 
tremely large number of cases, necessitating a computer-assisted proof. For example see the work 
of Lafont on Boolean circuits [20]. 
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Thanks to the well-known universality result [21] this interpretation demonstrates 
that QuCirc can represent all unitary maps between qubits. In fact, since we have 
the projections (0|, (1|, all linear maps can be represented. Note, however, that al¬ 
though all quantum circuits can be represented, without a set of rewrite rules QuCirc 
cannot express any non-trivial equalities between them. We could propose various 
sound equations here, but there is no known collection of rewrite rules which makes 
QuCirc complete with respect to this interpretation into Qubit. If such a set of 
rewrites did exist, it would constitute provide a presentation of the unitary group by 
generators and relations. 


1.3 The Dagger 

Now we introduce the dagger. This is simply an operation on the processes of a 
theory, sending every process / : A —> B to another process /' : B —» A. We call /' 
the adjoint of /. In the graphical calculus, we represent the dagger by a flip in the 
horizontal axis: 



Note that we have made the box asymmetric to make this flipping evident. For more 
general diagrams, the dagger flips a diagram upside down, preserving all the internal 
structure. Taking this claim at face value, we can derive the key properties of the 
dagger: 
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°A,B ~ °BA 

The dagger allows two important concepts to be defined. 

Definition 1. A process / : A —► B is called unitary if /o = 1 B and of = 1 A . 
A process is called self-adjoint when f = f. 

Example 8 (Finite-dimensional Hilbert spaces). The theory FHilb admits a dagger: 
it is the usual adjoint of a linear map. In this theory, the abstract definitions of uni- 
tarity and self-adjointness coincide with the usual one. 

Example 9. In the theory FRel, the dagger of a relation r : X —> Y is defined by the 
converse relation, i.e. 

r T = {(y,x) | (x,y) G r} 

Here, unitary processes are exactly those relations which encode permutations. A 
relation is self-adjoint whenever it is symmetric. Hence the self-adjoint unitaries in 
FRel are exactly the permutations of order 2. 

We extend the definition of mapping to demand that it also preserves the dagger. 
That is, given two theories with dagger, we require that a map F : C — ► D satisfies 

Fif) = ( Ff ) f 

Example 10 (Quantum Circuits). We define a dagger on QuCirc as follows: 
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It’s now easy to check that the interpretation map introduced earlier, [•] : QuCirc —► 
Qubit preserves the dagger as required. 


Remark 4. The theory of Boolean circuits, BoolCirc, does not admit a dagger. 
However, we could formally add new basic processes corresponding to the ad- 
joints of the basic processes of BoolCirc and thus define a new theory, BoolCirc'. 
Since the converse of a function is not in general a function, the interpretation 
B : BoolCirc —> Bool no longer makes sense. Instead we must interpret BoolCirc 1 
over FRel 2 , that is as Boolean relations rather than functions. In this case B again 
defines a valid interpretation BoolCirc 1 —► FRel?. The resulting theory is a model 
of non-deterministic computation. 


In any theory, a process of type p : I —> A is called a point, or sometimes a state, 
of A. Dually, a process of type e : A —► I is called a co-point, or sometimes an effect 
on A. For example, in FHilb the points y/ : I —> A are in one-to-one correspondence 
with the vectors of A, while in FRel a point s : I — > X is precisely a subset of X. 

In a theory with a dagger the set of points is isomorphic to the set of copoints 
(or in other language, for every state there is a corresponding effect and vice versa). 
This allows us to define another important concept. 

Definition 2. Given two points y/, (/) :I A we define their inner product as @ : o y/. 
Dually, the outer product is defined as (j) oy/'. 


As one may expect, the inner product is always a scalar. The diagrammatic lan¬ 
guage automatically allows the same tricks—and more— as Dirac notation does in 
Hilbert spaces. Indeed one can view the diagrammatic language as a 2-dimensional 
generalisation of Dirac notation. 

Example 11 (Finite-dimensional Hilbert spaces). In FHilb the inner product defined 
by the dagger, is exactly the usual inner product (<j>\y/). 

Example 12 (Sets and Relations). In FRel the inner product r 1 os is 0 if the r and s 
are disjoint as subsets, and 1 otherwise. 
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2 Pure state quantum mechanics 

2.1 The elements of an operational theory 

It is remarkable that the the basic language of quantum mechanics—states, effects, 
unitarity, self-adjointness, inner products, tensor products—can all be defined in the 
abstract setting of generalised compositional theories. We now have enough material 
to describe a formal operational framework for pure state quantum mechanics in 
purely diagrammatic terms. 

• A preparation is any process which produces a state; that is to say it is process 
of type p ; / —> A. 



Preparations are not restricted to producing single systems; a preparation process 
of type I —> A\ <8» • • • A„ is called multipartite. Of course, multipartite preparations 
need not be separable. 



When interpreted in FHilb each preparation process yields a ray in some Hilbert 
space, which, ignoring global phase, we may identify with a specific quantum 
state. It may happen, depending on the equations of the formal theory, that dif¬ 
ferent preparation processes produce the same state. 

• A transformation is any process which acts on states and produces new states, 
and which is unitary; 

JL 

U 

Once again, transformations may act on one or many systems at the same time. 



U 


A i | A i | 

• Measurements are processes which accept quantum inputs and produce classical 
information about the state which was input. Since, for now, our theory only has 
pme states, we will work with non-degenerate post-selected measurements 6 ; i.e. 
we know that a definite outcome has occurred, and that outcome corresponds to 
a definite quantum state. Therefore, measurements are one-dimensional effects, 
represented as co-points: 


6 In other words, rank 1 projectors. 
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A i 

The classical information is implicit in the choice of copoint, and hence not repre¬ 
sented. Since copoints do not have quantum outputs, these processes correspond 
to demolition measurements, where the original system is consumed by the mea¬ 
surement process. However, by combining an effect with the corresponding state 
preparation we can also represent non-demolition measurements: 






To properly represent the non-determinism of quantum measurements we need to 
consider mixed states; this is dealt with in Section 4. More general measurements 
can be represented within the theory, however they will not be described here. 

This basic recipe—preparations, transformations, and measurements—allows 
any experimental setup to be described in terms of the processes which realise it. 
More precisely, since we use post-selected measurements, the diagram really repre¬ 
sents a run of the experiment where a certain outcome occurred. We call an exper¬ 
iment closed when it has no external inputs or outputs. Any closed experiment is 
necessarily described by a process of type x : I —> /; that is, a scalar. This scalar is 
the abstract counterpart to the probability amplitude for performing the process and 
observing the specified result. Indeed, when such a diagram is interpreted in FHilb, 
the result is exactly the probability amplitude. 

Example 13. The theory QuCirc has the structure described above, and we can use 
it to define a simple experiment. For example, the diagram below corresponds to 
preparing a qubit in the |0) state, applying a unitary gate to it, and, upon measuring 
in the computational basis, finding that the qubit is in the state 11). 




-W/2 

5 

3 


Using the interpretation map [•] : QuCirc —> Qubit we can calculate the amplitude 
for this experimental result. 
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To summarise the elements of the framework, a formal generalised compositional 
theory consists of: 

• A collection of basic systems and processes, corresponding to the available “lab 
equipment”. 

• The collection of all diagrams constructed from the basic processes, correspond¬ 
ing to every possible experiment that could be built from the given equipment. 
We consider diagrams modulo topological equivalence: equivalent diagrams cor¬ 
respond to the same experiment. 

• A (possibly empty) collection of axioms, presented as rewrite rules over dia¬ 
grams, which specify behavioural equivalence of processes. These rules tell us 
when a piece of the experimental setup can safely be replaced by another without 
changing the result of the experiment. 

• Finally, given the above, we’ll usually consider (sound) interpretation maps of 
the formal theory into some concrete mathematical structure, such as Hilbert 
spaces. 

So far we have been operating at an extremely high level of generality. To focus our 
attention on quantum systems we will now gradually introduce more structure to our 
theories. We identify certain structural features of the Hilbert space presentation of 
quantum mechanics, and provide an abstract realisation of those features in terms of 
basic processes and equations, whose behaviour reproduces various quantum phe¬ 
nomena in the abstract setting of generalised compositional theories. 

The rest of this section will layout which basic processes and equations we will 
need to realise. As we do so, we’ll say goodbye to some of the models introduced 
earlier, but the two most important ones, FHilb and FRel, will still be applicable. 


2.2 Duals 

The next piece of structure that will be required is the existence of duals 1 . 
Definition 3. A system A has a dual if there exists a system A* and processes 
eA : l —> A* (g)A and dA'-A®A*^l 

such that we have the following equations: 

7 For the experts in category theory, this additional structure can be summed up by saying we 
operate in a dagger-compact category, rather than just a symmetric monoidal category. 
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Wa® U)°(U®eA) = 1 a (1a*0c/a)°(<?a0 1a*) = 1a* 

Since this definition is rather hard to parse we will immediately move to its diagram¬ 
matic form. We indicate the dual system A* by a wire labelled by A but directed in 
the opposite direction. The maps d A and e A are represented by wires with half turns, 
henceforth “caps” and “cups”. The equations above then take the form of “straight¬ 
ening wires”: 



In general a system might have more than one dual, but they are all guaranteed to 
be isomorphic. We’ll assume that every system has a given dual, and in particular 
(A0B)* = B* 0A*, in which case d A ®B and e A ®B take the form of nested caps and 
cups. Furthermore, we’ll assume that the double dual A** = A. These simplifications 
automatically hold in any theory presented diagrammatically; taking them as the 
general case saves a lot of bureaucracy. 

Example 14 (Finite dimensional Hilbert spaces). Let A be a Hilbert space of dimen¬ 
sion d, then A* is the usual dual space; that is, the space of linear functionals from 
A to the complex numbers. Supposing that {|a,-)} is a basis for the space A, then the 
cup and cap are given by the linear maps 


e A : 1 £(a,j 0 |a,-) , d A : £ |a,-) 0 (a,-| 1 . 

i i 

However since we are in a finite-dimensional setting we could also choose A* = A; 
specialising to the case of qubits, we can now view the cup and cap as the preparation 
and projection onto a Bell state: 

e Q = l°0) + |11) dQ = (00| + (11| 

Recall the quantum teleportations protocol: Alice has some unknown state that she 
wishes to send to Bob, but they do not share a quantum channel. However they have 
a classical channel, and have previously shared a Bell pair. In order to send her qubit 
to Bob, Alice measures her two qubits in the Bell basis, and transmits the result to 
Bob. Now Bob simply applies some unitary map (depending on Alice’s outcome) 
to his half of the Bell pair to recover the qubit that Alice wanted to send. Since, 
for the moment, we are operating in a post-selected setting, we’ll assume that Alice 
observes the outcome corresponding to the state <P + = (|00) + |ll))/\/2) at her 
measurement. In this case Bob need do nothing to his qubit. The whole set up is 
shown below: 
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Alice 



Knowing that the projection onto <J> + is just the effect (Iq, we can rewrite the proto¬ 
col as shown and demonstrate the protocol purely diagrammatically: 





Example 15 (Sets and Relations). In FRel, the dual of a set X is just the same set X 
again. The cup is given by the “name” of the identity: 


ex = {(*,(*,*)) \xGX} 


while the cap, dx is just the converse of ex- 


Using caps and cups, we can turn any process / : A —> B into a process on the 
dual objects going in the opposite direction: /* : B* —> A*. 



This is sometimes called the transpose of /, but this terminology can be misleading. 
In FHilb, f* is the map that takes a linear form (q € B* to (iq \f £ A*. We refer to 
this map simply as the upper-star of /. Clearly, we have /** = /. 

It is also required that the dagger and the duals interact nicely. More precisely we 
have the equations: 



t 




In any theory with both a dagger and duals, we can define a third operation, the 
lower-star of / as /* := (/^)* = (/*)^. Again this is involutive, i.e. /** = /. We’ll 
return to the uses of the upper and lower stars in Section 4. 

Finally, the cup and cap can be used to define a trace in purely diagrammatic 
terms: 
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Checking the Hilbert space interpretation, it is easy to see that this coincides with 
the usual definition. 


Tr([/7) 


(E <“l) ° (u ® /) ° (E l//»=E ( i ’l /10 

i J i 



In the diagrammatic form it is trivial to prove that trace is invariant under cyclic 
permutation: 



The partial trace can be defined analogously. 



( . 

B 

\ 

A 

1 8 

Tr£ 

[ 

u 

= 

[ 

u 



B 

I 

A 

l B 


By adding duals we have enlarged the class of possible diagrams, since wires 
may now loop back from inputs to outputs and vice versa, but the basic principle of 
diagram equality does not change: Two diagrams are considered equal if one can 
be smoothly transformed to another, by bending, stretching, or crossing wires, 
and moving boxes around. With this in mind we can update the key theorem. 

Theorem 2 (Fundamental Theorem of Diagrams with Daggers and Duals). 

Given any two generalised compositional theories C and D with daggers and duals, 
and a map F : C —> D, for any two diagrams d and d' in C, if d = d' as diagrams 
then Fd = Fd' in D. 


Once again, the full details are found in [19]. 

Remark 5. We need not demand any additional conditions on the class of mappings 
to guarantee the preservation of duals; since they are defined in terms of processes, 
the structure is automatically preserved. 
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2.3 Observable Structures 

An observable yields classical data from a physical system [22]. The key distinction 
between classical and quantum data is that classical data may be freely copied and 
deleted, while this is impossible for quantum data, due to the no-cloning [23, 24] 
and no-deleting [25] theorems. 

In quantum mechanics, an observable is represented by a self-adjoint operator. 
This (non-degenerate) operator encodes certain classical data as its orthonormal ba¬ 
sis of eigenstates, the possible outcomes of the corresponding measurement. Note 
that if a quantum state is known to be a member of a given orthonormal basis, such 
as the eigenbasis {|«,-)} of some observable, then it can be copied and deleted via 
the maps 

8 : | at) i->- |a,-) ® |a,-) and e : \ai) >->• 1. 

Hence we can view the classical content of a quantum measurement as the possibil¬ 
ity to copy and delete its set of outcomes. We will axiomatise quantum observables 
by describing the copying and deleting operations as algebraic structures inside a 
general compositional theory. The relevant structure is called a f -special commuta¬ 
tive Frobenius algebra , and we will now build up its definition one piece at a time. 

Definition 4. A commutative monoid in C is a triple (X,ju,rj), where /i and rj are 
maps 

p : X (g)X —tX r/ :I —tX 

which we write graphically as ju = , /] = A. These operations satisfy the follow¬ 

ing equations: 



Remark 6. The process p can be understood as a multiplication for systems of type 
A; the first and last equations assert that this operation is associative and commuta¬ 
tive respectively. The process t] is the unit for this multiplication: the second equa¬ 
tion asserts that multiplication by the unit is simply the identity. 

The dual to a monoid is a comonoid. 

Definition 5. A comonoid in a theory C consists of a triple (X,S,e) where 8 and e 
are processes 

S:X^X®X £ : A —> / 

satisfying the equations of Definition 4 but in reverse, viz: 
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A comonoid is cocommutative if it satisfies: 


The processes 8 and e are called the comultiplication and counit respectively. 

Example 16. We have already met the basic example of a comonoid: in FHilb, for 
any orthonormal basis {x, },- of a space A we obtain a comonoid via ‘copying’ and 
‘erasing’ processes mentioned above: 


8 : x\ i t Xi ®Xi £ : Xj H > 1 


Remark 7. Thanks to the dagger, if (A, 8, e) is a comonoid then (A, 8 1 , £ ' ) is auto¬ 
matically a monoid, and vice versa. 

Generally speaking, a process is called a homomorphism if it preserves some 
algebraic structure. In the context of GCTs, such preservation is usually expressed 
by a process commuting with another which reifies that structure. For example: 

Definition 6. Given two comonoids (A, 8, e) and (A', 8 1 , e'), a comonoid homomor¬ 
phism is a process /: A —> A' such that 

8’ o / = (/ ® /) o 5 and e' of = £ . 



Monoid homomorphisms are defined similarly. 

Remark 8. The definition above is the most general, but we will frequently en¬ 
counter cases where / : A —> A is homomorphism between two comonoids defined 
on the same object, or from a single comonoid to itself. 


The structures of greatest interest for this paper are algebras containing both 
monoids and comonoids. 


Definition 7. A commutative Frobenius algebra is a 5-tuple (A, 5, £. f .1 . t] ) where 

1. ( X,S,£ ) is a cocommutative comonoid; 

2. (A, jtl, rj) is a commutative monoid; and, 

3. 8 and t/ satisfy the following equations: 
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Finally, we can define: 

Definition 8. A f -special Frobenius algebra (f-SCFA) is a commutative Frobenius 
algebra 


0o — (Mo • ^ ®X y X 7 r) 0 : l v X , 
S 0 :X-^X®X, So :X->/) 


such that S 0 = (jU 0 )9 £o = (Vo)^ and 

The preceding definitions may seem rather opaque, and not fully justified by the 
intuition that a quantum observable is somehow encoded by the maps which copy 
and delete its eigenstates. However complex it may appear (and we shall shortly 
simplify it), the importance of Definition 8 rests on the fact [26] that in FHilb ev¬ 
ery f-SCFA arises from a comonoid defined by copying an orthonormal basis as 
described above. Since orthonormal bases define non-degenerate quantum observ¬ 
ables, f-SCFAs are also called observable structures. 

Concretely, given an orthonormal basis {|/)},■ then 8 0 :: |i) i—>■ \ii) defines an ob¬ 
servable, and all observables are of this form for some orthonormal basis. The re¬ 
sulting intuition is that S 0 is an operation that ‘copies’ basis vectors, and that e 0 
‘erases’ them [22]. We will use the symbolic representation (/i G . t] 0 . 8 0 . s D ) and the 
pictorial one (^, 6 ? , 9) interchangeably. 

Example 17 (Sets and Relations). Perhaps surprisingly, FRel also has many distinct 
observable structures, which have been classified by Pavlovic [27]. Even on the two 
element set there are two, namely 

S 0 : i {((,*)} 

0 ^ {( 0 , 0 ),( 1 , 1 )} 

1 ^ {( 0 , 1 ),( 1 , 0 )} 

In fact, this pair is strongly complementary in the sense to be explained in Section 3. 

Manipulating observable structures in the graphical language is extremely con¬ 
venient since they obey a remarkable normal form law. Let S„ : X —> X® n be defined 
by the recursion 

S 0 :=e S n+ i := (S n (g)l A )oS 

and define p m analogously. Now we have the following important theorem: 

Theorem 3. Given an SCFA (X ,5 ,e,/x, i]) let f : X® m X® ,! be a map constructed 
from S,e,p and rj whose graphical form is connected. Then f = S n o p in . 

Proposition 1. Given an observable structure f/ 0 on X, let (o)" f denote the ‘( n,m)- 
legged spider’: 
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then any morphism X m —► X®' n built from fi 0 , r) 0 ,5 0 and e 0 via f-SMC struc¬ 
ture which has a connected graph is equal to the (o)”'- Hence, spiders compose as 
follows: 

M 

Thanks to the spider rule (2), every observable structure on X makes X dual to 
itself (in the sense of Definition 3), via the cup and cap: 




The upper-star with respect to this cup and cap corresponds in FHilb to transposition 
in the given basis. For that reason, we call this the o-transpose /"- jT . The lower star 
corresponds to complex conjugation in the basis of 6 0 , so we call it the o-conjugate 

f® ■= (/ OT ) t - 

Recall that a process k : I —> X is called a point ofX. In FHilb the points of X are 
simply vectors in the Hilbert space X. The abstract analogue of the eigenvectors of 
an observable in FHilb are the classical points of an observable structure. 

Definition 9. A classical point for an observable structure is a state that is copied 
by the comultiplication and deleted by the counit: 



We will depict classical points as triangles of the same colour as their observable 
structure. 

Remark 9. Another way to say the same thing, is to define classical points as 
comonoid homomorphisms from the trivial comonoid (/, 1/, 1/) to (X,S,e). 

In quantum computing, it is common to think of elements of a product basis as 
strings of some kind. E.g. for qubits: 

1010011) o |0)®|1)®|0>®|0)®|1>®|1) 

Such product bases are precisely the classical points of products of observable struc¬ 
tures. Given an observable structure f7 0 on X, and another f/ 0 on Y, we form a new 
observable structure on X 0 Y by taking their product: 
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Evidently any pair of classical points for the constituent observable structures will 
be copied. 



Generalising, the classical points of any u-ary product of observable structures are 
nothing more than lists of classical points, one for each factor. 

Working concretely in Hilbert space, one can use the linear structure to give 
another set of equations for observable structures. Consider some basis vector (), 
then the map \ii)(i\ has the diagrammatic form: 



But notice that the sum £,• \ ii)(i\ is nothing more than the the map 8 : ;) -* \ii). A 
similar statement can be made for the counit e. Hence given the complete set of 
classical points for an observable structure we have the following equations: 



A - 5 ^7 ? - A f 

Indeed these can be generalised to arbitrary spiders: 



Note that generalised compositional theories do not necessarily admit addition of di¬ 
agrams: we introduce these equations as way of generalising from concepts defined 
in Hilbert space to the abstract setting where there need not be any linear structure. 

Linear maps have the property that if two maps are equal on every element of a 
basis, the maps themselves are equal. In analogy to this we define the following: 

Definition 10. Let f)' 0 be an observable structure on X, with classical points 
we say that 0 O has enough classical points if, for every system Y , and every pair of 
processes /,g : X —> L, we have 
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(Vi: f oki = goki) => f = g . 

This property does not necessarily hold in an arbitrary GCT (although obviously it 
does in FHilb) however when it does hold certain statements can be made stronger. 
For example, many implications described in the subsequent sections are equiva¬ 
lences if the underlying object has enough classical points. 


2.4 Phase Group for an Observable Structure 

Let t f! and (!) be two points of X. Given an observable structure f)' 0 on X, applying 
the multiplication p 0 to if/ and 0 yields another point of X: 

V+o 0 := M° (Y®#) 



Since fj, 0 is commutative and associative, and it has a unit point (namely rj 0 ). the 
operation + c gives the points of X the structure of a commutative monoid. 

If we restrict to those points i/r: / —► X which satisfy 


H% = J lo 



we obtain an abelian group <J> 0 , called the phase group of [28, 29]. The elements 
of this group are called phases. Note the phase group is non-empty, since the unit 
T]o satisfies the required equation. We let — a := a® for phases, and represent these 
points as circles with one output, labelled by a phase. 


Example 18. In FHilb, let 6' 0 be defined by some orthonormal basis {JOne can 
verify by direct calculation that a vector | \jf) lies in the phase group <J> 0 if and only if 
we have | (i\\jf) | 2 = 1/D. for all i, where D is the dimension of the underlying space. 
Such vectors are called unbiased for the basis {|i)The multiplication is then the 
convolution (pointwise) product. 
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Concretely, for a qubit observable given by fi 0 = |0)(00| + 11}(111, the phases 
are the unbiased states, which are all of the form: 



with the multiplication: 



We therefore see that the phase group for a qubit observable is the circle group. It 
is an easy exercise to check that for a Z)-dimensional Hilbert space the phase group 
for any observable is isomorphic to the (D — l)-dimensional torus. 


The name ‘phase group’ comes from fact that the elements of the <J> 0 correspond 
to unitary maps which preserve the basis defining & 0 • We can map any point (//:/—> 
X onto an operation on X via the left action, A 0 (y/) = JU 0 o (ys® lx), or in pictures: 


A 0 : 



i—^ 



Then we have the following facts: 

Proposition 2. Let (fiy/ £ <P Q ; then 

1. A o (0) is unitary; 

2. A o (0) oA 0 (y/) =A o (0 + y/) =A o (y/-)oAo(0) 

3. Ifk is a classical point for G 0 then A o (0) o k = k(g>s for some scalar s. 
Proof 1. We show that A 0 (i//)^ °A 0 (V^) = 1-' 



The first equation is the spider rule while the second is the definition of y /®. The 
case A 0 (y) ° (Ao( y)Y — 1 similar. 

2. This follows immediately from the associativity and commutativity of p 0 : 
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3. This follow from the definition of classical points. 



The image A 0 ( < P 0 ) is therefore an abelian subgroup of the unitaries on X, which 
is isomorphic to <J> 0 . We refer to these as phase maps. If we reinterpret the third part 
of the preceding proposition in terms of linear algebra, we see that every classical 
point of 0 O is an eigenvector of every phase map i n Ao (<P 0 ) . This in turn “explains” 
why they commute with each other. 

Example 19. Let 0 O be defined by jU 0 = |0}(00| +11)(111 as above. Now for a <G <P 0 
we have 



Hence the phase group in Hilbert spaces is exactly the group of phase shifts relative 
to the given basis. 


Generalising from the preceding discussion, we can now introduce ‘spiders dec¬ 
orated with phases’: 



which compose as follows: 



In the following sections we will refer to this generalised composition rule for 
phased spiders as the spider law. 
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2.5 Two toy models 

In this section we’ll introduce two “toy” models of quantum mechanics. The first is 
the restriction of quantum mechanics to stabilizer states; this theory we call Stab. 
The second is the toy model due to Spekkens [30], which we refer to as Spek. 
While the first of these is indeed a true subtheory of quantum mechanics, Spek is a 
local hidden variable model. By casting both of these in the language of generalised 
compositional theories we can see that the difference between is in fact very slight. 

Before discussing these concrete models, we’ll introduce a formal precursor the¬ 
ory. Let Toy be the general compositional theory built from the formal generators: 

• one basic system, which we denote T ; 

• six points zo,Zi,xo,xi,yo,yi : / —> T and their corresponding copoints; 

• 24 unitary maps T —> 7’ which form a group isomorphic to the symmetric group 

S 4 ; 

• one observable structure G a , whose classical points are zo and zi, and whose 
phase group comprises the remaining four points. 

Note that Toy is not fully specified: to do so we ought to say which group the phase 
group is, and how the corresponding unitaries embed into the endomorphisms of T. 
Since <J> 0 is a four-element group we have only two choices here: Z 4 , or Z 2 x Z 2 . 
As we will see this choice will make the difference between stabilizer quantum 
mechanics and the quantum-like local hidden variable theory. 

Let Stab be the sub theory of FHilb generated by the following elements: 

• One basic system C 2 , which we call Q. 

• Six points I —> Q: 

zo = |0> *> = 7f(l°> + |i» ^>H‘1i}) 

zi = |l> *1 = ^|(|0>-|1» » = ^<|°>-.-|l» 

• The group of unitaries generated by the matrices: 

z./ 2 =(i") x,„=jL= (_)-;) 

This group is known in the quantum computation literature as the Clifford group 
for one qubit; it is isomorphic to S 4 . The other key property of this group is that 
it acts as a permutation on the states defined above, so we cannot generate new 
states via unitaries. 

• An observable structure & 0 defined by the basis |0), 11). 

Evidently the classical points of 0 O are indeed zo and zi and the remaining points 
are unbiased for this basis, hence part of <J> 0 . One can check that 


A) Cvo) — Zjj/2 
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and which generates a four element cyclic subgroup, hence the phase group <J> 0 is 

^4. 

We now introduce Spekkens’ toy theory. The toy theory is a local hidden variable 
theory, based on epistemic restrictions. There is a single basic system, the toy bit, 
which can have one of four possible states. We formalise the state space simply as 
a four-element set. However, we now impose the epistemic restriction that any state 
preparation (and, dually, measurement) may only narrow down the state to two of 
the possible four. Hence the “states” of the toy theory are two-element subsets. Al¬ 
though Spekkens’ original presentation [30] was informal, the toy theory is ideally 
studied as sub theory of FRel. Following [31], let Spek be the sub theory of FRel 
generated by the following elements: 

• One basic system, the four element set 4 = {0,1,2,3}. 

• Six points: 


20 = {0,1} VO = {0,2} yo = {0,3} 

zt = {2,3} xi = {1,3} yt = {1,2} 


• The full group of permutations on 4; 

• An observable structure 0 O defined by 


{ 00 , 11 } ~ 0 
. { 01 , 10 } ~ 1 
1 {22,33} ~ 2 
{23,32} ~ 3 


770 :*~{ 0 , 2 } 


where we write the tensor as juxtaposition, i.e. 00 = (0,0). 

Once again we easily check that the classical points for f)' 0 are zo and z ,\, and the 
other four form the phase group <J> 0 ■ The phase group in this case is generated by 
the transpositions (0 1) and (2 3); hence <f> 0 — Z 2 x Z 2 . 

As should be evident by this point both Stab and Spek are realisations of the in¬ 
complete theory Toy. The only notable difference between them is the group struc¬ 
ture of <J> 0 . This highlights the importance of the phase group for understanding 
non-locality in generalised compositional theories. 


Remark 10. In the description above the group of unitaries was given a priori. This 
is not necessary. If we include a second observable structure 0 O , corresponding to 
the classical points xq and x\, the the union of the two phase groups <J> 0 and <J> 0 
yields all unitaries described above. These two observables are complementary in 
the sense described below. Hence these two theories are in a sense minimal. 
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3 Complementarity and Strong Complementarity 


In the Hilbert space presentation of quantum mechanics, two observables are com¬ 
plementary if their bases of eigenstates are mutually unbiased. That is, for any i,j. 


Vi V,: 


1 /D. In the graphical notation: 



1 

D 


A question posed by Coecke and Duncan [28, 29] was, “Can we represent com¬ 
plementarity purely in terms of interacting observable structures?” It turns out that 
complementarity is equivalent to a simple diagrammatic equation. First, we can 
move l/D in the above equation to the other side and express it as a circle, as 
the trace of the identity always equals D. Then, replace 1 on the RHS with “deleted 
points”. 



(6) 


As we saw in section 2.3, observable structures fix an isomorphism of a space with 
its dual space, via the transpose. While it is not true in general that \y/) G ' T = (y/\, 
the transpose does take classical points for a particular observable structure to their 
adjoints: 

|v,-) OT = (v/| and 


'y>° T = M 




Graphically: 


Exercise. Prove this. 

We can rewrite the left hand side of equation (6) using this fact. 


(7) 



= o 


= o 


The last equation follows by substituting the symbol 5 for its definition, viz: 


S) = 




( 8 ) 


(9) 


Unifying Equations (6) and (8) we have: 
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( 10 ) 


Since this equation holds for all classical points i and j, if we now appeal to the 
fact that FHilb has enough classical points (cf. Definition 10), we can conclude that 
identity holds without points: 


o 



i 

? 


do 


Remark 11. The above equation is (up to a scalar factor) one of the defining equa¬ 
tions a Hopf algebra, in which case the map S is called the antipode. For that reason, 
we refer to (11) as the Hopf law. As we will see in the next section, subject to some 
additional assumptions, pairs of complementarity observables do indeed form Hopf 
algebras with the antipode defined as in Equation (9). 

Notice if we assume Equation ('ll) we can derive Equation (6) without any addi¬ 
tional assumptions. In other words, if €> 0 and f/ Q satisfy the Hopf law their classical 
points are mutually unbiased. Thus, we take the Hopf law to be the defining equation 
for our abstract notion of complementarity. 

Definition 11. A pair (&o, &•) of observables on the same object X is complemen¬ 
tary iff: 



Since every observable in FHilb has enough classical points. Definition 11 is 
equivalent 8 to saying that observables are complementary if their eigenbases are 
mutually unbiased with respect to the other. (See [28] for more details). Hence, 
we reclaim the usual notion of quantum complementarity, and extend it to a more 
general setting. 

Definition 12. A pair ( f / Q , f/ 0 ) of observables on the same object X is coherent iff: 


Y=n sy= ii 


8 Indeed Equations (6) and (11) are equivalent in any theory wherever at least one of the observable 
structures has enough classical points. 
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In other words, the unit point rj 0 (6) modulo a scalar factor, a classical point for 
Co, and vice versa. 


: 




i 


We will assume that the scalar g is always cancellable. 

Example 20. Consider the two observables on the Hilbert space C 2 corresponding 
to the Z and X spins: 


.|o>-hoo> 

°-|i)^|n) 


»-H++> 


J?o 


. 10)^1 

•|i)^i 


V o 


.!+)->! 
• |-)->l 


Computing rj 0 we obtain: 


J?o = (£o) t 


1 ^(| 0 ) + | 1 )) 


V^|+) 


which is indeed proportional to a classical point for 8 0 ■ By a similar computation 
we obtain rj 0 = \/210), from which the value of their inner product § = \fl follows. 
The equations of definition 12 can easily be verified from here, demonstrating that 
C 0 and Co are coherent. 


Proposition 3. In FHilb if 0 Q and 0 o are self-adjoint operators corresponding to 
complementary observables, one can always choose a pair of coherent observable 
structures (C 0 ,C») whose classical points correspond to the eigenbases of 0 o and 
0 o . 

Proof. Let { |a,-} }" =1 and { | bj) }" =1 be orthonormal eigenbases for 0 o and O a re¬ 
spectively. Since the bases are mutually unbiased we have 

\bj\ = \cC\j \a \) -(- • • • CC r !j \af) 1 
vn 


where the 0 Cjj are scalars satisfying |a,y| = 1. Setting \a\) = an |a,-), we see that 
{\a'l)}j is also an orthonormal eigenbasis for 0 o , which is still mutually unbiased 
with respect to { \bj)}j. Now define: 

So : |a-) i ^ |a-) <g> | a',) 

So : | a!-f H v 1 


This choice yields T] 0 = (£o) 1 ' = L/I a /) = \fS\b\)- 
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In a similar fashion we can choose an eigenbasis \b\)\b' n ) for 0 o such that 
the resulting 8 0 and e 0 satisfy (e 0 )^ = yfn |oj). It is straightforward to verify that 
this can be done such that \b\) = \b\), ensuring the coherence of ( 3 0 and 6 0 . 

For this reason we will from now on assume that pairs of complementary observ¬ 
ables are always coherent. 


3.1 Strongly complementary observables 

Many familiar observables, when expressed in terms of algebras, turn out to have 
useful additional properties. These are called strongly complementary, before de¬ 
scribing them we will require some preliminary definitions. 

Definition 13. A (commutative) bialgebra on A is a 4-tuple (jU, rj, 8 , e) of maps, 

ju : X ®X X S:X->X®X 

tj : I -» A £ : X -> I , 

which we write graphically as 6> , 6)’ such that: 

• [X,ji, r/) is a (commutative) monoid; 

• (X,8,e) is (cocommutative) comonoid; and, 

• the following additional equations are satisfied: 



Remark 12. Note that equations (13) and (14) are very similar to the equations re¬ 
quired for the coherence of two observables, per Definition 12. The only difference 
there is that the scalar 0 is not assumed to be trivial. 

Definition 14. A (commutative) Hopf algebra on A is a (commutative) bialgebra on 
A, augmented with a map s : A —> A, called the antipode, which satisfies: 
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Again, note the similarity to Equation (11): the difference is only by a scalar factor. 

Definition 15. A pair ( f) Q . f)' a ) of observables on the same object A is strongly com¬ 
plementary iff they are coherent and: 



(16) 


To expand on this definition slightly, given a pair of strongly complementary 
observables, if we consider just the monoid part of one and the comonoid part of 
the other then the resulting structure is, up to a scalar factor, a bialgebra. Note 
that thanks to the up-down symmetry induced by the f it doesn’t matter which is 
the monoid and which the comonoid. For obvious reasons, the we say that a pair 
of strongly complementary observables forms a scaled bialgebra, and we refer to 
Equation (16) as the bialgebra law. Notice that we have not, as yet, established any 
connection between complementarity (Definition 11) and strong complementarity. 
The following theorem links the two definitions. 

Theorem 4. Let & 0 and 0% be strongly complementarity observables; then they are 
complementary. 

Proof. Let s be defined by 



as per Equation (9). Using the bialgebra law we reason: 



The last equation relies on the fact that rj 0 is classical for (and T\o for 0 O ), and 

(7). 

As a consequence, strongly complementary observables always form a scaled Hopf 
algebra. Note that Theorem 4 relies on the fact that and & 0 are Frobenius alge¬ 
bras; it is certainly not the case that every bialgebra is a Hopf algebra. 

The converse to Theorem 4 does not hold: it is possible to find coherent comple¬ 
mentary observables in FHilb which are not strongly complementary. See [29] for 
a counterexample. 

The following lemma about the antipode for a strongly complementary pair was 
shown in [32]. 

Lemma 1. If are strongly complementary, and have enough classical 

points then the antipode s is: 

• self-adjoint; 

• a monoid homomorphism from 0 O to itself and similarly for 0 O ; and 

• a comonoid homomorphism from & a to itself and similarly for 
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3.2 Strong Complementarity and Phase Groups 


For complementary observables, classical points of one observable are always in¬ 
cluded in the phase group of the other observable, up to a normalizing scalar. Strong 
complementarity strengthens this property to inclusion as a subgroup. Let JC 0 be 
the set of classical points of 0 O multiplied by the scalar factor 

Theorem 5. Let (Co, 0») be strongly complementary observables and let C 0 have 
finitely many classical points. Then forms a subgroup of the phase group <P 0 of 
Co - The converse also holds when C a has ‘enough classical points’. 

Proof. By strong complementarity it straightforwardly follows that, up to a scalar, 
go applied to two classical points of 0 O yields again a classical point of C Q \ 



The unit of <P 0 is, up to a scalar, also a classical point of C a by coherence. Hence, 
.'Xf is a submonoid of <J> 0 and any finite submonoid is a subgroup. The converse 
follows from: 



together with the ‘enough classical points’ assumption. 

Recall that the exponent of a group G is the maximum order of any element of 
that group: exp(G) = max{|g| : g G G}. 

Corollary 1. For any pair of strongly complementary observables, let k = exp(.Xf). 
Then, assuming 0 O has ‘enough classical points’: 



Proof. In a finite abelian group, the order of any element divides cxp(.Xf). The 
result then follows by: 



together with the ‘enough classical points’ assumption. 
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Proposition 4. For a pair of strongly complementary observables (p is a 6' 0 - 

homomorphism for all (7) e .JC Q . Conversely, this property defines strong comple¬ 
mentarity provided 8 0 has ‘enough classical points’. 

Proof. Similar to the proof of Thm. 5. 


3.3 Classification of Strong Complementarity in FHilb 

Corollary 2. Every pair of strongly complementary observables in FHilb is of the 
following form: 

f So :: |g) H> |s> <g> |s) f <5o " |g) ® I h) H- \g + h) 

\ £o :: Is) | -> 1 \ ej :: 1 1 ->- \fb |0) 

where (G = {g,h, .. 0) is a finite Abelian group. Conversely, each such pair is 
always strongly complementary. 

Proof. By Theorem 5 it follows that the classical points of one observable (here f) 0 ) 
form a group for the multiplication of the other observable (here 5 ))). and in FHilb 
this characterises strong complementarity. 

One of the longest-standing open problems in quantum information is the charac¬ 
terisation of the number of pairwise complementary observables in a Hilbert space 
of dimension D. In all known cases this is D + 1, and the smallest unknown case 
is D = 6. We now show that in the case of strong complementarity this number is 
always 2 for D > 2. 

Theorem 6. In a Hilbert space with D > 2 the largest set of pairwise strongly com¬ 
plementary observables has size at most 2. 

Proof. Assume that both (C 0 ,C 0 ) and (C 0 ,C») are strongly complementary pairs. 
By coherence 5 a ''d 4 must be proportional to classical points of 6 ' 0 . If (C 0 , ()\) 
were to be strongly complementary observables, it is easily shown that 8 7^0 so 6 
and 4 are proportional to the same classical point. Hence, up to a non-zero scalar: 

t =1=1=4 

V* fb 9 

i.e. the identity has rank 1, which fails for D > 2. By Corollary 2 a strongly comple¬ 
mentary pair exists for any D > 2. 
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4 Mixed states, measurements, and “abstract probabilities” 


For some ket | \jf) in a Hilbert space, there are (at least) four distinct ways to represent 
|y/) as a linear map. 

It is possible to represent a ket | y/) £ H as a map y/j : C —> H, sending 1 £ C to 
1 1 j/}. We call such a map a “point” of //, because it does nothing more than picking 
out a specific element. The second map is the associated bra (y/j : H —> C. This kind 
of map is called a “co-point”. We can also regard such a map as an element of the 
dual space H*. But then, H* is just another Hilbert space, so we could just as well 
represent (y/| as a point of H*. That is, a linear map (y/|* : C —> H*. There is yet a 
fourth way to represent |y/), namely as a linear map |y/)* ://*—>■ C, sending a bra 
(01 £ H* to the inner product (01 y/) £ C. 

So, for a given ket |y/}, there are four ways to write it as points or copoints. 


\V)=> 




<Vl=* 




The difference in these four pictures is largely notational: the data they represent 
is the same. However, its a very useful piece of notation, especially when represent¬ 
ing functionals between spaces of maps. Firstly, we note that we can represent a 
map M: A —> B as a special kind of point, jf^) G A* (g)B. 


These two objects clearly represent the same data. In fact, this mapping is essen¬ 
tially the Choi-Jamiolkowski isomorphism. However instead of fixing a basis, we 
rely on the dual space A*. Thus the value on the right does not depend on a choice 
of orthonormal basis. By fixing a basis 2% = (|/)}, we can define a transposition 
map Tyg(\i)) = (i\. Then the usual C-J isomorphism is recovered as (Tsg ® 1 ) | 0m)- 
However, this does depend on a choice of basis, since Tgg does. 

In [8], the authors refer to |*f 'm) as the “name” of a map. We shall often find this 
representation more useful than the usual C-J representation, especially in instances 
involving several distinct orthonormal bases. Using the caps and cups from before, 
we can isomorphic ally relate maps and their associated names. 



The benefit of working with names of maps, as opposed to the maps themselves 
becomes clear when we start considering higher-order functionals. For a finite¬ 
dimensional Hilbert space H , let L(H) be the space of linear maps from H to itself. 
When operating on density matrices, we often want to consider maps of the form 
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<J>: L(H) —> L(K). We can either treat this as a genuine, higher-order map, or we 
can treat it as a first-order map from names to names. 

($)- 


Since, in finite dimensions, we have an isomorphism L(H) = H* 0 //, we know 
that all maps <J>: L(H) —> L(K) can be represented this way. 

In ordinary quantum theory, mixed quantum states are represented as positive 
operators and operations as completely positive maps, or CPMs. These are maps 
that take positive operators to positive operators. A general CPM can be written in 
terms of a set of linear maps { B, \ H —r K) called its Kraus maps. 

0(p) = £z?,pfi+ 

i 

As before, we can collapse the higher-order map 0 to a first- order map by trans¬ 
lating the positive operator p to its associated name. 

t 

:= P 

Then, we can encode the Kraus vectors of 0 in a map B' = £ and represent 

0 as: 



When we take the elements in Eq. (19) to be morphisms in an arbitrary f-compact 
category, this gives us an abstract definition of a completely positive map. This is 
Selinger’s representation of CPMs [33], 


(19) 


Important special cases are states where A = /, effects where B = /, and ‘pure’ 
maps, where C = I. 
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4.1 Measurements and Born vectors 


Returning to quantum mechanics, we can see how a quantum measurement would 
look in this language. A (projective) quantum measurement M 0 is a CPM that sends 
trace 1 positive operators (in this case quantum states) to trace 1 positive operators 
that are diagonal in some ONB (encoding a probability distribution of outcomes). 
Suppose we wish to measure with respect to an observable ff 0 , whose classical 
points form an ONB {|x,}}. The probability of getting the ;-th measurement outcome 
is computed using the Born rule. 

Prob(;,p) = Tr(|x/)(x,jp) 

We can write this probability distribution as a vector in the basis {|x,}}. That is, 
a vector whose ;-th entry is the probability of the i-th outcome: 

M o( p)=ETr(k)(x ! -|p)k) 

So, M defines a linear map from density matrices to probability distributions. 
Expanding this graphically, we have: 



vvy 






We are now ready to make definitions for abstract measurements and abstract 
probability distributions, which we shall call Born vector. 


Definition 16. For an observable structure 0 O , a measurement is defined as the fol¬ 
lowing map: 


m o := 



Any point |A ) 0 :1 — y X of the following form is called a Born vector , with respect 

to &o'- 



Theorem 7. In FHilb, Bom vectors for an observable ( 3 0 are precisely those vec¬ 
tors whose entries are positive and sum to 1, when written in the basis of 0 O . 


Proof Let |A) 0 be a Born vector. Its i-th coefficient in the A 0 -basis is given by: 
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We can see that these coefficients sum to 1 by using the definition of the deleting 
point: 


I 

i 



lc 


This is a completely positive map from C to C. In other words, it is a positive scalar. 
For the converse, assume |A ) 0 is a probability distribution whose i-th coefficient in 

the (3 0 -basis is /?,■ £ R + . Then, letting if/ = Yy/Pi 




Post-composing with the deleting point yields Y(y/Pi) 2 : YPi 1. 


Thus Born vectors in FHilb correspond precisely to probability distributions over 
classical points. 

We can naturally extend the definition above to points of the form |A ) 0 : / —» X ® 
... by requiring that they be Born vectors with respect to the product Frobenius 
algebra ® ® @ 0 . 

The adjoint of the measurement map m\ is a preparation operation. In FHilb, it 
takes a Born vector |A ) Q with respect to A 0 and produces a probabilistic mixture of 
the (pure) outcome states of 0 O with probabilities given by |A ) 0 . 

This leads to a simple classical vs. quantum diagrammatic paradigm that applies 
in all of the models we consider [22]: classical systems are encoded as a single wire 
and quantum systems as a double wire. The same applies to operations, and m 0 and 
m' 0 allow passage between these types. 

Note that the classical data will ‘remember’ to which observable it relates, cf. the 
encoding Yi Pi \ x i)- This is physically meaningful since, for example, when one mea¬ 
sures position the resulting value will carry specification of the length unit in which 
it is expressed. If one wishes to avoid interconversion of this ‘classical data with 
memory’, one could fix one observable, and unitarily transform the quantum data 
before measuring. Indeed, if 
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then 


U, 


U 


U* 


u 


measures the -observable but produces Z'o-data. In FHilb, all observable struc¬ 
tures are unitarily isomorphic, so any projective measurement can be obtained in 
this way. A particularly relevant example is when these unitaries are phases with 
respect the another observable structure (J 0 . 



When f)' 0 is induced by the Pauli spin-Z observable and (7 0 by the Pauli spin-A 
observable, then m 0 = is an X measurement and m % J~ is a Y measurement. Note 
however, that both produce Born vectors of outcome probabilities with respect to 
the basis. This will be useful in the next section. 


5 Example: non-locality of QM 

In 1989 Greenburger, Horne, and Zeilinger provided an analysis [34] of quantum 
theory which improves on Bell’s theorem in one crucial way. Whereas Bell demon¬ 
strated a probabilistic argument that quantum theory is incompatible with the as¬ 
sumption of local realism (i.e. quantum theory generates correlations that are too 
high for a classical local hidden variable model), the GHZ/Mermin theorem illus¬ 
trates a situation that rules out a locally realistic model possibilistically . That is, 
they described a series of experiments for which quantum theory predicts a single, 
definite outcome that is impossible under the assumption of locality. 

Here, we reproduce Mermin’s version of this argument [35] using diagrammatic 
techniques. There are two key ingredients for this translation. The first is a graphical 
notion of locality. For our purposes, it will suffice to treat locality as the fact that 
global probability can be traced down to hidden states that determine all measure¬ 
ment outcomes, since we shall show that no hidden state can ever be compatible 
with the predictions of quantum theory. Hence, there is no point in even considering 
crafting a local hidden variable representation. 

The second key ingredient is parity. The GHZ/Mermin trick for producing def¬ 
inite outcomes is to look not at individual measurement outcomes, but the overall 
parity of those outcomes, i.e. “Did the experiment produce and even or an odd num¬ 
ber of clicks?”. Considering a single outcome (click or no-click) as an element of 
the abelian group Z 2 , the parity of a set of outcomes is given by their sum in the 
group. We already saw in section 3.3 that strongly complementary observables are 
classified by abelian groups. In two dimensions, there is only one such strongly 
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complementary pair, namely the one corresponding to Z 2 . When we prepare a GHZ 
state with respect to a certain observable (e.g. spin-Z) and conduct measurements 
using a strongly complementary observable (e.g. spin-Z), we will see this ZL struc¬ 
ture arise. 

By combining these two elements (the topological picture of locality and the 
encoding of abelian groups as strongly complementary observables) we will derive 
a contradiction. Furthermore in section 5.4, we shall see how strong complementary 
was embedded in the pre-conditions of a GHZ/Mermin-style argument in the first 
place. 


5.1 A local hidden variable model 

For a particular «-party state 'F) in some theory, a local hidden variable (LHV) 
model for that state consists of: 

• a family of hidden states |A), each of which assigns to any measurement on each 
subsystem a definite outcome; and, 

• a probability distribution on these hidden states, 

which simulates the probabilities of that theory. We say that a theory is local if each 
state admits a LHV model. 

Consider three systems and four possible (compound) measurement settings, 
XXX, XYY, YXY, and YYX. A hidden state of an underlying LHV model stores 
one measurement outcome for each setting on each system: 

x y x y x Y 

|^- / ) = I ' + ^+, ) 

system 1 system 2 system 3 

We can represent this diagrammatically as follows: 


ZZZ XYY YXY YYX 



that is, we simply copy those values to each of the four measurement settings. 
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5.2 Encoding the GHZ state and computing correlations, 
diagrammatically 


To present Mermin/GHZ style argument graphically, we first show how to compute 
measurement outcomes for an n-party GHZ state graphically. This computation re¬ 
lies on a standard theorem about bialgebras, which relates a graph-theoretic property 
of diagrams to equality of bialgebra expressions. 

Definition 17. Let (^,6)^; 9) be a commutative, cocommutative bialgebra, 

and let be a diagram consisting only of ^, 5, 5 9i identity maps, and swaps. 

Then, the characteristic matrix % of £> is a matrix of natural numbers where the 
(i,y)-th entry represents the number of forward-directed paths connecting the ;-th 
input to the y'-th output. 

Example 21. The following terms have characteristic matrix 



The following terms have characteristic matrix 



Theorem 8. If two diagrams generated by the same bialgebra have the same char¬ 
acteristic matrix, they are equal as maps. 


Proof, (sketch) It is possible to show by case analysis that the three bialgebra equa¬ 
tions can be used to move all of the gray dots to the top all the white dots to the 
bottom. 


A" ?? Y* 4i 

We can furthermore show that all three of these transformations preserve the char¬ 
acteristic matrix of $>. Once this is done, we obtain a diagram in normal form: 




/10 0 \ 
000 
1 2 0 
00 1 
\0 0 3/ 
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Then, it is possible to show there is exactly one such normal form for each char¬ 
acteristic matrix. In fact, it is straightforward to read off the matrix by counting 
edges in the normal form. Since every diagram can be put into normal form using 
equations that preserve the characteristic matrix, and normal forms are in 1-to-l 
correspondence with characteristic matrices, this completes the proof. 9 

We can now apply the theorem to prove the following corollary. 

Corollary 3. The following equation holds for any connected bipartite graph with 
directions as shown. 


n n 



Proof. For the theorem on bialgebras to apply, all of the edges need to be directed 
upward. For a strongly complementary observable, the edge direction between two 
different colours can be reversed by applying the antipode S. Then, we use the fact 
that S is a Frobenius algebra endomorphism to move it down. 



We apply Theorem 8 and the spider theorem to complete the proof. 



We compute the classical probability distributions (= if/ 0 -data) for n measure¬ 
ments against arbitrary phases a, £ <f> 0 on n systems of any type in a generalised 
GHZ"- state: 



9 For a formal statement and proof of this theorem, in terms of factorisation systems see [36]. 
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Applying Corollary 3, we note that this is a probability distribution followed by 
a o-copy. 



The following is an immediate consequence. 

Theorem 9. When measuring each system of a GHZ 1 ^-state against an arbitrary an¬ 
gle then the resulting classical probability distribution over outcomes is symmetric. 

Theorem 10. The classical probability distributions for 1 0 ... 0 -measurements 
on a GHZ '^-state is: 

• uncorrelated if |£a,-) 0 is a classical point for 0 O and, 

• parity-correlated if |£0£,) e is a classical point i for f/ a (i.e. contains precisely 
those outcomes i\ 0 ... 0 i n such that the sum of group elements £4 is equal to 
i). 

Example 22. We can compute the outcome distributions for XXX, XYY, YXY, and 
YYX measurements on three qubits in a GHZ-state using the technique described 
above. First, outcome distribution |A) 0 for AAA: 



Computing correlations as in Figure (22) is symmetric in the choice of measure¬ 
ment angle for each of the systems. Thus, for the other two cases (YXY and YYX), 
we get the same result: |Bi) 0 = \B 2) 0 = |fi3) 0 - 


5.3 Deriving the contradiction 


Consider the function: 
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We have already seen that strongly complementary observables correspond to group 
algebras. That is, A defines a group algebra over the classical points of 0 O - For 
qubits there is only one choice: Z 2 . Thus, this function computes the parity (i.e. the 
Z 2 -sum) of all outcomes. 

Measuring the parity for any local hidden state we obtain: 



that is, by (17): 

i 


and hence: 



i 


and measuring the parity in quantum theory we obtain: 



that is, by the previous section: 
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and hence: 

(§) 


which yields a contradiction. 


5.4 GHZ/Mermin assumptions and the necessity of strong 
complementarity 

We shall examine two assumptions that play a key role in a GHZ/Mermin style 
non-locality argument, and show that the presence of a strongly complementary 
observable arises as a consequence. 

The original argument due to Green burger, Horne, and Zeilinger [34] and later 
simplifications [37, 35] focus on a state defined in terms of correlated (or anti¬ 
correlated) Z-spins and local spin measurements in the XY-plane. We generalise 
this assumption as follows. 

Assumption 1 (Coherence). We will use a GHZ state defined with respect to 
some observable structure Co. Measurements are all conducted within a Co- 
phase of some coherent observable C 0 . 

In FHilb, all observables containing at least one unbiased classical point, w.r.t. 
C Q , are within a C 0 -phase of a coherent observable, so we could weaken this as¬ 
sumption further. That is, if C 0 contains an unbiased classical point, we might as 
well assume it is coherent, since Assumption 1 allows us to freely choose phases. 

By Assumption 1, the correlations associated with each experiment are com¬ 
puted from this diagram: 




The second assumption is what [34] refers to as “super-classicality”. We shall 
refer to it as sharpness. 
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Assumption 2 (Sharpness). After all subsystems except one are measured, the 
final measurement outcome is fixed. 

The map is called the decoherence map for <0 o . It projects from the space of 
all quantum mixed states to the the space of classical mixtures of eigenstates of 
To assert sharpness, we require that, once two of the three systems are measured, 
the third is invariant under this map: 



Plugging the unit of 0 O in the 2nd system both for LHS and RHS, and using coher- 



and by exploiting symmetry we have: 



Hence we obtain: 
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Since o ( 1 x O O', ) is unitary it cancels. Thus our assumptions lead us to con¬ 
clude the following equation for the observable structures (<0 O , &•)'■ 



(27) 


Proposition 5. A pair (0 O , ) of coherent observables satisfying Equation (27) are 

strongly complementary. 


Proof. First, we show that equation (27) implies the following, for any pair of co¬ 
herent observables: 



The proof goes as follows: 



...which implies: 
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Equation (28) is very nearly the required equation for strong complementarity, 
but the directions are wrong. However, we can correct this by first showing the 
following equations, using coherence and (28): 






Then, we complete the proof by using the equations above to change the direc¬ 
tions of the arrows on the inside: 




Thus any coherent pair of observable structures satisfying Equation (27) is a strongly 
complementary pair. 


6 Summary and Further Reading 


In this chapter, we developed the notion of a generalised compositional theory, a 
new approach to studying quantum mechanics and constructing foil theories with 
quantum-like properties. The main building blocks for a GCT are: 

• a collection of systems A,B,C ,..., 

• a collection of primitive processes, and 

• a notion of horizontal composition (g> and vertical composition o. 

From this sparse setting, we began to add extra pieces of structure. 

• symmetry maps => “permutibility of systems” 

• dagger => “time-reversed processes” 

• duals => “map/state duality” 

This structure and its diagrammatic presentation give a rich language for talk¬ 
ing about composed processes. We then went on to define various concepts in this 
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framework, often by analogy to their Hilbert space counterparts: pure states, re¬ 
versible dynamics, quantum observables, complementarity, mixed states, and mea¬ 
surements. Using these ingredients, we worked through a complete example, fol¬ 
lowing Mermin’s illustration of a possibilistic locality violation, as predicted by 
quantum mechanics. 

The interested reader can find many papers related to, or extending the formal¬ 
ism introduced in this chapter. One example is the ZX-calculus, which is a graph¬ 
ical calculus for the interaction of the Pauli-Z and Pauli-X observable structures. 
In addition to the usual rules (complementarity, strong complementarity), several 
other rules are added, which turn out to be complete for stabiliser quantum mechan¬ 
ics [38]. This calculus has been applied to the study of measurement-based quantum 
computing [15], topological MBQC [16], and quantum protocols [39], 

The ideas developed in section 4 originated in [22]. In [40], a simplified formal¬ 
ism for interacting classical and quantum data was developed, and can be viewed as 
an abstraction of the C*-algebraic approach to the study of quantum information. 
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